LFDA: A Probabilistic Graphical Model for the Study of Excitation Emission Matrices
نویسندگان
چکیده
of a dissertation at the University of Miami. Dissertation supervised by Professor Miroslav Kubat. No. of pages in text. (108) Traditional classification techniques assume samples are described by vectors of features. However, in some domains samples are gathered by measuring a variable with respect to two or more other variables: for a given value of x and y measure z. In such domains, samples are more naturally described by matrices or by higher dimensional arrays. We present a novel latent Dirichlet allocation (LDA)-based approach for modeling and analyzing fluorescent spectroscopy excitation-emission Matrices (EEMs) and other three way datasets. We introduce parallels between topic modeling and three-way arrays which allow us to create adaptations to use LDA-based methods in latent fluorophore studies. The proposed framework views the EEMs as being generated from an underlying hidden pool of flourophore compounds, and provides a latent flourophore-space representation of an EEM. We show that this LDA-based model can increase classification performance, especially when paired with parallel factor analysis (PARAFAC) which may be regarded as perhaps the most popular and widely used tool for dealing with EEMs. Our experiments show that the proposed LDA-based algorithm is in some cases more robust than PARAFAC to certain types of noise and data disturbances. We also observe that pairing this LDAbased method with PARAFAC leads to an improvement in classification performance and to added robustness at high peak-signal-to-noise-ration (PSNR) values. We also present an extended graphical model that incorporates the effect of outside variables that may affect fluorescent expression of certain compounds. The extended model offers further insight into the interaction between these variables and the latent fluorophore components while facilitating the model building process. The performance of machine learning algorithms is known to be impaired if the representation of the individual classes in the training set is imbalanced, i.e., one class outnumbering the other class(es). Such is the case for several experiments in this proposal. Many approaches to deal with this problem have been developed, none of them totally satisfactory. Here we propose membership-based minority oversampling (MeMO), as yet another possible solution, and explores, experimentally, the conditions under which it outperforms earlier attempts. Finally we introduce a Dempster-Shafer based fusion model that is intended to adaptively merge the PARAFAC and LDA-based models when their outputs are being used for classification purposes.
منابع مشابه
Combined Unfolded Principal Component Analysis and Artificial Neural Network for Determination of Ibuprofen in Human Serum by Three-Dimensional Excitation–Emission Matrix Fluorescence Spectroscopy
This study describes a simple and rapid approach of monitoring ibuprofen (IBP). Unfolded principal component analysis-artificial neural network (UPCA-ANN) and excitation-emission spectra resulted from spectrofluorimetry method were combined to develop new model in the determination of IBF in human serum samples. Fluorescence landscapes with excitation wavelengths from 235 to 265 nm and emission...
متن کاملCombined Unfolded Principal Component Analysis and Artificial Neural Network for Determination of Ibuprofen in Human Serum by Three-Dimensional Excitation–Emission Matrix Fluorescence Spectroscopy
This study describes a simple and rapid approach of monitoring ibuprofen (IBP). Unfolded principal component analysis-artificial neural network (UPCA-ANN) and excitation-emission spectra resulted from spectrofluorimetry method were combined to develop new model in the determination of IBF in human serum samples. Fluorescence landscapes with excitation wavelengths from 235 to 265 nm and emission...
متن کاملMCR of the quenching of the EEM of fluorescence of Aflatoxins (B1, G1) by Gold nanoparticles
In This research, gold nanoparticles were synthesized and functionalized by the antibody of aflatoxins. The quenching of the fluorescence of excitation emission matrices (EEM) of two type of aflatoxins (B1, G1), provoked by the gold nanoparticles, was studied by principal component analysis (PCA) and multivariate curve resolution with alternating least squares (MCR-ALS). These aflatoxins show q...
متن کاملMCR of the quenching of the EEM of fluorescence of Aflatoxins (B1, G1) by Gold nanoparticles
In This research, gold nanoparticles were synthesized and functionalized by the antibody of aflatoxins. The quenching of the fluorescence of excitation emission matrices (EEM) of two type of aflatoxins (B1, G1), provoked by the gold nanoparticles, was studied by principal component analysis (PCA) and multivariate curve resolution with alternating least squares (MCR-ALS). These aflatoxins show q...
متن کاملRule-based joint fuzzy and probabilistic networks
One of the important challenges in Graphical models is the problem of dealing with the uncertainties in the problem. Among graphical networks, fuzzy cognitive map is only capable of modeling fuzzy uncertainty and the Bayesian network is only capable of modeling probabilistic uncertainty. In many real issues, we are faced with both fuzzy and probabilistic uncertainties. In these cases, the propo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016